Docket No.: 07844-437001 



WHAT IS CLAIMED IS: 

1 . A computer program product, stored on a machine-readable medium, comprising 
instructions operable to cause a programmable processor to: 

search a document for one or more unambiguous words, where unambiguous words 
are words that do not contain an ambiguous typesetting placeholder; 

automatically add the one or more unambiguous words to a dictionary; 

search the document for one or more ambiguous words, where ambiguous words are 
words that do contain an ambiguous typesetting placeholder; and to 

use the dictionary to resolve the one or more ambiguous words by resolving the 
ambiguous typesetting placeholders occurring in each ambiguous word. 

2. The computer program product of claim 1 , wherein the instruction to automatically 
add one or more ambiguous words to a dictionary comprises instructions to add the one or 
more ambiguous words to an initially empty dictionary. 

3 . The computer program product of claim 1 , wherein the instruction to automatically 
add one or more ambiguous words to a dictionary comprises instructions to add the one or 
more ambiguous words to a dictionary containing one or more unambiguous words located in 
one or more documents that have been previously processed by the computer program. 

4. The computer program product of claim 1 , wherein the instruction to use the 
dictionary to resolve the ambiguous typesetting placeholders in each ambiguous word, 
comprises instructions operable to cause a programmable processor to: 

create a set of candidate solutions for the ambiguous word, wherein each candidate 
solution comprises one or more character strings created by resolving the one or more 
ambiguous typesetting placeholders in the ambiguous word, and wherein the set of candidate 
solutions comprises all possible combinations of resolutions of the one or more typesetting 
placeholders; 

search the dictionary for the one or more character strings in each candidate solution; 

and 
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use the dictionary search result to resolve the one or more ambiguous typesetting 
placeholders in the ambiguous word. 

5. The computer program product of claim 4, wherein the instruction to create a set of 
candidate solutions for an ambiguous word having N binary-resolvable typesetting 
placeholder ambiguities, comprises instructions to create a set of 2 N candidate solutions. 

6. The computer program product of claim 4, wherein the instruction to use the 
dictionary search result to resolve the one or more ambiguous typesetting placeholders in the 
ambiguous word, further comprises instructions to resolve the one or more ambiguous 
typesetting placeholders in conformity with the one or more resolutions used to create a 
member of the set of candidate solutions when the dictionary search matches only that 
member of the set of candidate solutions. 

7. The computer program product of claim 4, wherein the instruction to use the 
dictionary search result to resolve the one or more ambiguous typesetting placeholders in the 
ambiguous word, further comprises instructions to prompt a user to manually resolve the one 
or more ambiguous typesetting placeholders in the ambiguous word when the dictionary 
search fails to match any member of the set of candidate solutions. 

8. The computer program product of claim 4, wherein the instruction to use the 
dictionary search result to resolve the one or more ambiguous typesetting placeholders in the 
ambiguous word, further comprises instructions to prompt a user to manually resolve the one 
or more ambiguous typesetting placeholders in the ambiguous word when the dictionary 
search matches a plurality of members of the set of candidate solutions. 

9. The computer program product of claim 4, wherein the instruction to use the 
dictionary search result to resolve the one or more ambiguous typesetting placeholders in the 
ambiguous word, further comprises instructions to resolve the one or more ambiguous 
typesetting placeholders in conformity with the one or more resolutions used to create the 
candidate solution having the largest word when the dictionary search matches a plurality of 
members of the set of candidate solutions. 
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1 0. The computer program product of claim 9, wherein the instruction to use the 
dictionary search result to resolve the one or more ambiguous typesetting placeholders in the 
ambiguous word, further comprises instructions to resolve the one or more ambiguous 
typesetting placeholders in conformity with the one or more resolutions used to create the 
candidate solution having the fewest words when the dictionary search matches a plurality of 
members of the set of candidate solutions. 

1 1 . The computer program product of claim 4, wherein the instruction to use the 
dictionary search result to resolve the one or more ambiguous typesetting placeholders in the 
ambiguous word, further comprises instructions to resolve the one or more ambiguous 
typesetting placeholders in conformity with the one or more resolutions used to create the 
candidate solution having the smallest word when the dictionary search matches a plurality 
of members of the set of candidate solutions. 

12. The computer program product of claim 1 1 , wherein the instruction to use the 
dictionary search result to resolve the one or more ambiguous typesetting placeholders in the 
ambiguous word, further comprises instructions to resolve the one or more ambiguous 
typesetting placeholders in conformity with the one or more resolutions used to create the 
candidate solution having the most words when the dictionary search matches a plurality of 
members of the set of candidate solutions. 

13. The computer program product of claim 4, wherein the ambiguous typesetting 
placeholders comprise hyphens resolvable as hard hyphens or soft hyphens. 

14. The computer program product of claim 1 4, further comprising instructions operable 
to cause a programmable processor to output the character code for the correct ambiguity 
resolution. 

15. The computer program product of claim 4, wherein the ambiguous typesetting 
placeholders comprise white space between characters resolvable as blank space or kerning 
space. 
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1 16. The computer program product of claim 1 6, further comprising instructions operable 

2 to cause a programmable processor to add space to an ambiguous white space resolved to be 

3 blank space and to remove space from an ambiguous white space resolved to be kerning 

4 space. 

1 17. A computer program product, stored on a machine-readable medium, comprising 

2 instructions operable to cause a programmable processor to: 

3 search the document for one or more unambiguous words, where unambiguous words 

4 are words that do not contain an ambiguous typesetting placeholder; 

5 automatically add the one or more unambiguous words to a dictionary; 

6 search the document for an ambiguous word, where an ambiguous word is a word 

7 that does contain an ambiguous typesetting placeholder; 

8 create a set of candidate solutions for the ambiguous word, wherein each candidate 

9 solution comprises one or more character strings created by resolving the one or more 

I o ambiguous typesetting placeholders in the ambiguous word, and wherein the set of candidate 

I I solutions comprises all possible combinations of resolutions of the one or more typesetting 

12 placeholders; 

13 search the dictionary for the one or more character strings in each candidate solution 

14 of the ambiguous word; 

15 resolve the one or more ambiguous typesetting placeholders in conformity with the 

16 one or more resolutions used to create a member of the set of candidate solutions when the 

17 dictionary search matches only that member of the set of candidate solutions; 

1 8 prompt a user to manually resolve the one or more ambiguous typesetting 

19 placeholders when the dictionary search fails to match any member of the set of candidate 

20 solutions; and to 

21 prompt a user to manually resolve the one or more ambiguous typesetting 

22 placeholders when the dictionary search matches a plurality of members of the set of 

23 candidate solutions. 

1 18. A method for resolving an ambiguous word in an electronic document, 

2 comprising: 
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searching the document for unambiguous words, where unambiguous words are 
words that do not contain one or more ambiguous typesetting placeholders; 

automatically adding the unambiguous words to a dictionary; 

searching the document for an ambiguous word, where an ambiguous word is a word 
that contains one or more ambiguous typesetting placeholders; and 

using the dictionary to resolve the ambiguous word by resolving the one or more 
ambiguous typesetting placeholders occurring in the word. 

19. The method of claim 1 , wherein the step of using the dictionary to resolve the 
ambiguous word by resolving the one or more ambiguous typesetting placeholders, further 
comprises: 

creating a set of candidate solutions for each ambiguous word, wherein each 
candidate solution comprises one or more character strings created by resolving the one or 
more ambiguous typesetting placeholders in the ambiguous word, and wherein the set of 
candidate solutions comprises all possible typesetting placeholder resolution combinations; 

searching the dictionary for the one or more character strings in each candidate 
solution; and 

using the dictionary search to resolve the one or more ambiguous typesetting 
placeholders in the ambiguous word. 

20. The method of claim 20, further comprising resolving the one or more ambiguous 
typesetting placeholders in conformity with the one or more resolutions used to create a 
member of the set of candidate solutions when the dictionary search only matches that 
member of the set of candidate solutions. 

21. The method of claim 20, further comprising prompting a user to manually resolve the 
one or more ambiguous typesetting placeholders when the dictionary search fails to match 
any member of the set of candidate solutions. 

22. The method of claim 20, further comprising prompting a user to manually resolve the 
one or more ambiguous typesetting placeholders when the dictionary search matches a 
plurality of members of the set of candidate solutions. 

- 16- 



Docket No.: 07844-437001 



1 23. The method of claim 20, further comprising resolving the one or more ambiguous 

2 typesetting placeholders in conformity with the one or more resolutions used to create the 

3 candidate solution having the largest word when the dictionary search matches a plurality of 

4 members of the set of candidate solutions. 

1 24. The method of claim 20, further comprising resolving the one or more ambiguous 

2 typesetting placeholders in conformity with the one or more resolutions used to create the 

3 candidate solution having the smallest word when the dictionary search matches a plurality 

4 of members of the set of candidate solutions. 

1 25 . The method of claim 20, wherein the ambiguous typesetting placeholders comprise 

2 ambiguous hyphens resolvable into hard hyphens or soft hyphens, further comprising 

3 outputting the character code for the correct ambiguity resolution. 

1 26. The method of claim 20, wherein the ambiguous typesetting placeholder comprises an 

2 ambiguous white space between characters resolvable to a blank space or a kerning space, 

3 further comprising adding space to an ambiguous white space resolved to be blank space and 

4 removing space from an ambiguous white space resolved to be kerning space. 
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